FrogWild! - Fast PageRank Approximations on Graph Engines

نویسندگان

  • Ioannis Mitliagkas
  • Michael Borokhovich
  • Alexandros G. Dimakis
  • Constantine Caramanis
چکیده

We propose FrogWild, a novel algorithm for fast approxi-mation of high PageRank vertices, geared towards reducingnetwork costs of running traditional PageRank algorithms.Our algorithm can be seen as a quantized version of poweriteration that performs multiple parallel random walks overa directed graph. One important innovation is that we in-troduce a modification to the GraphLab framework thatonly partially synchronizes mirror vertices. This partialsynchronization vastly reduces the network traffic generatedby traditional PageRank algorithms, thus greatly reducingthe per-iteration cost of PageRank. On the other hand,this partial synchronization also creates dependencies be-tween the random walks used to estimate PageRank. Ourmain theoretical innovation is the analysis of the correla-tions introduced by this partial synchronization process anda bound establishing that our approximation is close to thetrue PageRank vector.We implement our algorithm in GraphLab and compareit against the default PageRank implementation. We showthat our algorithm is very fast, performing each iteration inless than one second on the Twitter graph and can be up to7× faster compared to the standard GraphLab PageRankimplementation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting Fame and Fortune: PageRank or Indegree?

Measures based on the Link Recommendation Assumption are hypothesised to help modern Web search engines rank ‘important, high quality’ pages ahead of relevant but less valuable pages and to reject ‘spam’. We tested these hypotheses using inlink counts and PageRank scores readily obtainable from search engines Google and Fast. We found that the average Google-reported PageRank of websites operat...

متن کامل

EmptyHeaded: Boolean Algebra Based Graph Processing

There are two types of high-performance graph processing engines: lowand high-level engines. Low-level engines (Galois, PowerGraph, Snap) provide optimized data structures and computation models but require users to write low-level imperative code; hence ensuring that efficiency is the burden of the user. In high-level engines users write in query languages like datalog (SociaLite) or SQL (Grai...

متن کامل

Approximating Personalized PageRank with Minimal Use of Web Graph Data

In this paper, we consider the problem of calculating fast and accurate approximations to the personalized PageRank score ([8, 16]) of a webpage. We focus on techniques to improve speed by limiting the amount of webgraph data we need to access. PageRank scores are mainly used for ranking purposes, and generally only the scores exceeding a given threshold are relevant. In practice, and relative ...

متن کامل

On Local Estimations of PageRank: A Mean Field Approach

PageRank is a key element in the success of search engines, allowing to rank the most important hits in the top screen of results. One key aspect that distinguishes PageRank from other prestige measures such as in-degree is its global nature. From the information provider perspective, this makes it difficult or impossible to predict how their pages will be ranked. Consequently a market has emer...

متن کامل

Local Approximation of Centrality Measures

Centrality measures provide a means to differentiate the importance of vertices in a network. These measures are mathematically clear, but the algorithms to compute them often have quadratic time complexity or worse. This may lead to significant computational challenges when applied to large networks. In this paper, we propose a local strategy for three frequently used centrality measures: (i) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • PVLDB

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2015